Serveur d'exploration sur SGML

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Clustering of XML documents

Identifieur interne : 001964 ( Main/Exploration ); précédent : 001963; suivant : 001965

Clustering of XML documents

Auteurs : Damien Guillaume [États-Unis] ; Fionn Murtagh [Royaume-Uni]

Source :

RBID : ISTEX:D3AA01AB63527FA108046A9B121443AEFF4BA385

English descriptors

Abstract

Abstract: Self-organization or clustering of data objects can be a powerful aid towards knowledge discovery in distributed databases. The web presents opportunities for such clustering of documents and other data objects. This potential will be even more pronounced when XML becomes widely used over the next few years. Based on clustering of XML links, we explore a visualization approach for discovering knowledge on the web.

Url:
DOI: 10.1016/S0010-4655(99)00511-1


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Clustering of XML documents</title>
<author>
<name sortKey="Guillaume, Damien" sort="Guillaume, Damien" uniqKey="Guillaume D" first="Damien" last="Guillaume">Damien Guillaume</name>
</author>
<author>
<name sortKey="Murtagh, Fionn" sort="Murtagh, Fionn" uniqKey="Murtagh F" first="Fionn" last="Murtagh">Fionn Murtagh</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:D3AA01AB63527FA108046A9B121443AEFF4BA385</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1016/S0010-4655(99)00511-1</idno>
<idno type="url">https://api.istex.fr/ark:/67375/6H6-ZRKLL3V8-9/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">003671</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">003671</idno>
<idno type="wicri:Area/Istex/Curation">002A80</idno>
<idno type="wicri:Area/Istex/Checkpoint">001793</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">001793</idno>
<idno type="wicri:doubleKey">0010-4655:2000:Guillaume D:clustering:of:xml</idno>
<idno type="wicri:Area/Main/Merge">001A08</idno>
<idno type="wicri:Area/Main/Curation">001964</idno>
<idno type="wicri:Area/Main/Exploration">001964</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Clustering of XML documents</title>
<author>
<name sortKey="Guillaume, Damien" sort="Guillaume, Damien" uniqKey="Guillaume D" first="Damien" last="Guillaume">Damien Guillaume</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>National Center for Supercomputing Applications, Astronomy Department, University of Illinois at Urbana-Champaign, 1002 West Green Street, Urbana, IL 61801</wicri:regionArea>
<placeName>
<region type="state">Illinois</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Murtagh, Fionn" sort="Murtagh, Fionn" uniqKey="Murtagh F" first="Fionn" last="Murtagh">Fionn Murtagh</name>
<affiliation wicri:level="1">
<country wicri:rule="url">Royaume-Uni</country>
</affiliation>
<affiliation></affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Computer Physics Communications</title>
<title level="j" type="abbrev">COMPHY</title>
<idno type="ISSN">0010-4655</idno>
<imprint>
<publisher>ELSEVIER</publisher>
<date type="published" when="2000">2000</date>
<biblScope unit="volume">127</biblScope>
<biblScope unit="issue">2–3</biblScope>
<biblScope unit="page" from="215">215</biblScope>
<biblScope unit="page" to="227">227</biblScope>
</imprint>
<idno type="ISSN">0010-4655</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0010-4655</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="Teeft" xml:lang="en">
<term>Algorithm</term>
<term>Astronomical markup language</term>
<term>Astronomical object</term>
<term>Astronomical object names</term>
<term>Astronomical objects</term>
<term>Average number</term>
<term>Balanced clusters</term>
<term>Best keyword</term>
<term>Best score</term>
<term>Cluster</term>
<term>Collaboration graph</term>
<term>Cost function</term>
<term>Current_solution</term>
<term>Damien guillaume</term>
<term>Database</term>
<term>Document</term>
<term>Elsevier science</term>
<term>Euclidean distance</term>
<term>Extensible markup language</term>
<term>Goto step</term>
<term>Graph partitioning</term>
<term>Graph partitioning algorithm</term>
<term>Graph partitioning problem</term>
<term>Grid</term>
<term>Guillaume</term>
<term>Html</term>
<term>Html documents</term>
<term>Ident</term>
<term>Indexing vocabulary</term>
<term>Information retrieval</term>
<term>Initial solution</term>
<term>Java applet</term>
<term>Jean heyvaerts</term>
<term>Keyword</term>
<term>Keyword links</term>
<term>Keywords</term>
<term>Knowledge discovery</term>
<term>Links</term>
<term>Local minima problem</term>
<term>Many articles</term>
<term>Many documents</term>
<term>Markup</term>
<term>Markup language</term>
<term>Murtagh</term>
<term>Murtagh computer physics communications</term>
<term>Node</term>
<term>Noising</term>
<term>Noising algorithm</term>
<term>Noising method</term>
<term>Noising rate</term>
<term>Original data</term>
<term>Other clusters</term>
<term>Other documents</term>
<term>Partitioning</term>
<term>Partitioning algorithm</term>
<term>Regular grid</term>
<term>Same cluster</term>
<term>Same keyword</term>
<term>Same number</term>
<term>Same time</term>
<term>Server</term>
<term>Similar documents</term>
<term>Total number</term>
<term>User</term>
<term>User clicks</term>
<term>User interface</term>
<term>Xlink</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Self-organization or clustering of data objects can be a powerful aid towards knowledge discovery in distributed databases. The web presents opportunities for such clustering of documents and other data objects. This potential will be even more pronounced when XML becomes widely used over the next few years. Based on clustering of XML links, we explore a visualization approach for discovering knowledge on the web.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Royaume-Uni</li>
<li>États-Unis</li>
</country>
<region>
<li>Illinois</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Illinois">
<name sortKey="Guillaume, Damien" sort="Guillaume, Damien" uniqKey="Guillaume D" first="Damien" last="Guillaume">Damien Guillaume</name>
</region>
</country>
<country name="Royaume-Uni">
<noRegion>
<name sortKey="Murtagh, Fionn" sort="Murtagh, Fionn" uniqKey="Murtagh F" first="Fionn" last="Murtagh">Fionn Murtagh</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Informatique/explor/SgmlV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001964 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001964 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Informatique
   |area=    SgmlV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:D3AA01AB63527FA108046A9B121443AEFF4BA385
   |texte=   Clustering of XML documents
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jul 1 14:26:08 2019. Site generation: Wed Apr 28 21:40:44 2021